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Abstract 

In this paper, we study the inner and outer boundary densities of some 
sets with self-similar boundary having Minkowski dimension s > d— 1 in 
These quantities turn out to be crucial in some problems of set estimation, 
as we show here for the Voronoi approximation of the set with a random 
input constituted by n iid points in some larger bounded domain. We prove 
that some classes of such sets have positive inner and outer boundary density, 
and therefore satisfy Berry-Esseen bounds in n~ s ^ 2d for Kolmogorov distance. 
The Von Koch flake serves as an example, and a set with Cantor boundary 
as a counter-example. We also give the almost sure rate of convergence of 
Hausdorff distance between the set and its approximation. 
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Notations We designate by d(.,.) the Euclidean distance between points 
or subsets of R d . The closure, the interior, the topological boundary and 
the diameter of a set A C R d are designated by cl (A), int(A), dA , diam(A) 
respectively. The open Euclidean ball with center x and radius r in R d is 
noted B(x, r). 

Given two sets A , B , we write A + B for {c 6 | c = a + b , a e A, b e B}. 

The Hausdorff distance between A and B is designated by dn(A, B ), that is 

d H (A, B ) = inf{r > 0 : A C B + B( 0, r), B C A + B( 0, r)}. 

Vol is the d-dimensional Lebesgue measure and Hd is the volume of the Eu¬ 
clidean unit ball. For s > 0, T~L S is the s-dimensional Hausdorff measure on 
R d . 

Throughout the paper, K c R d is a non-empty compact set with positive 
volume. The letters c, C are reserved to indicate positive constants that 
depend only on fixed parameters like K or d, and which value may change 
from line to line. 


Background 

Set estimation theory is a topic of nonparametric statistics where an unknown 
set K is estimated, based on partial random information. The random input 
generally consists in a finite sample x °f points, either IID variables [3, 19] 
or a Poisson point process mmm- Based on the information of which of 
those points belong or not to K , one can reconstruct a random approximation 
K x of K and study the asymptotic quality of the approximation. See the 
recent survey m Chap. 11] about related works in nonparametric statistics. 

The results generally require the set to be smooth in some sense. In the 
literature, the set under study is assumed to be convex esb m , r-convex 
mm, to have volume polynomial expansion [2], positive reach, or a (d— 1)- 
rectihable boundary [H]. Another class of regularity assumptions usually 
needed is that of sliding ball or rolling ball conditions (0IMII2S!). The most 
common form of this condition is that in every point x of the boundary, there 
must be a ball touching x and contained either in K, in A' c , or both. 

In those works, the random approximation model K x can be the union 
of balls centred in the points of x with well tuned radius going to 0, a level 
set of the sum of appropriately scaled kernels centred on the random points, 
or else. Recently, a different model has been used in stochastic geometry, 
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based on the Voronoi tessellation associated with y. One defines K x as the 
union of all Voronoi cells which centers lie in K, assuming that points of x 
fall indifferently inside and outside K , as K is unknown. This is equivalent 
to defining K x as the set of points that are closer to x O K than to x O K c . 

This elegant model presents practical advantages in set estimation. For 
volume estimation the bias and standard deviation rates of the Voronoi ap¬ 
proximation seem to be best among all estimators of which the authors are 
aware of, and hold under almost no assumption on K. Regarding shape es¬ 
timation, Voronoi approximation also consistently estimates K and dK in 
the sense of the Hausdorff distance (Proposition [3]), and here again conver¬ 
gence rates and necessary assumptions compare favourably to those of other 
estimators (see Theorem [f] and the following Remarks). 

An heuristic explanation of these features is that the estimator natu¬ 
rally fills in regions inside K where the sample y is sparse, without need for 
convexity-like assumptions on K [2Tj or parameter tuning [31 [8i Till. 

The reader will find a more formal presentation of Voronoi approximation 
along with a summary of existing results [5, ! T3[ Ei 20, [23] in Section [2j 


Approach and main results 


This work was inspired and is closely related to ra. in which a central limit 
theorem and variance asymptotics for Vo\(K x ) were obtained for binomial 
input under very weak assumptions on K. Here we slighlty enhance their 
central limit theorem by showing that Vo\(K x ) can be recentered by Vol(AT) 
instead of E(Vol(/l x )). Explicitly, for suitable K, we have for each e > 0 a 
constant C E> o such that 


sup 

teM 


/Vo 1 ( A- x J-V°I(A-) P{N>t) 
V \/Var(Vol(A- x J) ' 


< C,n-‘ /2d log(n) 1 -^‘ ,+ ‘ 


(1) 

where s is the Minkowski dimension of dli (see Section 1.2). 

We also show that with Poisson input we have the almost sure convergence 
rates for the Hausdorff distance 


c V linr inf 

n—>-+oo 

c V linr inf 

n—>•+OO 


(n~ l ln(n)) 1 /' i 
d H (dK,dK Xn ) 
(n _1 k^n)) 1 ^ 


A linr sup 

n^+oo 

V linr sup 

n —>-+oo 


dn(K, K\„) 

{n~ l hr(?r)) 1 / d ^ 


d H (dK,dK Xn ) 
(?z _1 ^(u)) 1 /^ 


( 2 ) 

(3) 
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thus answering a query raised in [T3] and extending the results obtained in 

0 . 

The assumptions on K necessary for §>■(§ and ([3]) to hold are worth 
of interest on their own. They are broad enough to allow for irregular K, 
a feature which few estimators possess and is useful in some applications 
(see |8j HU, and references therein). Also, they are not specific to Voronoi 
approximation, and might be crucial for other estimators. They are mainly 
concerned with the densities of K at radius r in x, defined by 

K( Vol(Jl fl B(x, r)) 

Jr K ’ Vol (B(x,r)) ’ 

f K* ( \ _ Vol(A' c fl B(x, r)) 

Ir {X} Vol {B{x,r)) ' 

For ease of notation, we shall simply write f r for fff and g r for f^ c , K being 
implicit in all of the paper. Boundary densities have already appeared in set 
estimation theory Mi, where a set K is said to be standard whenever 
f r ^ e on K for some fixed e > 0 and all small enough r. Here we shall 
prefer to specify inner standard since we are also interested in cases where 
the inequality g r > e holds. In the latter case, K is said to be outer standard , 
and if K is both inner and outer standard K will be said to be bi-standard. 
The condition on K for (|3]) to hold is essentially bi-standardness, which is a 
usual assumption in set estimation 0 Theorem 1], 

The requirement for (Jl| to hold seems to be new in set estimation theory. 
It consists in a positive lim inf as r —> 0 of the quantity 

Vol(dK r ) f Kc fr = Vol JdKr) L 9ri 

where dK r is the set of points within distance r from dK. See Assumption [2] 
and Proposition[2]for precise statements and equivalent assertions. The above 
quantity measures the interpenetration of K and Ii c along their common 
boundary, since the greater it is, the more homogenously K and K c are 
distributed along dK. This lead us to name the condition of Assumption [2] 
the boundary permeability condition. 

Study of densities on the boundary is also related with works in geometric 
measure theory. Points for which lim/ r (x) is 0 or 1 are considered resp. as 
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the measure-theoretic exterior and interior of K, while other points constitute 
d*K the essential boundary of K. Federer [TJ Th.3.61] proved that if K is 
a measurable set with finite measure-theoretic perimeter then f r —> 1/2 on 
most of the essential boundary. 

We address here the question of whether comparable results hold if dK is 
an irregular set, with self-similar features. In general, such boundaries have a 
Hausdorff dimension s > d— 1 and don’t have finite perimeter. But, because 
of self-similarity, the densities f r ,g r should nevertheless have continuous and 
somehow periodical fluctuations in r, and therefore a positive inhmum. This 
is confirmed by Theorem [lj which gives, for K with self-similar boundary, a 
set of conditions under which f r >e on the boundary uniformly in r > 0. It 
is even proved that a ball with radius cr for some c > 0 can be rolled inside or 
outside the boundary, staying within a distance r from the boundary, but not 
touching it (otherwise self-similar boundaries would be excluded). Theorem 
[ljapplies for instance to the Von Koch flake in dimension 2, which is therefore 
well-behaved under Voronoi approximation and satisfies Q, @ and @. 

Some sets with self-similar boundary do not fall under the scope of this 
result, and we also give example of a self-similar set A/antor with Cantor-like 
self-similar boundary not satisfying the boundary permeability condition. 
Simulations we ran suggest that this irregularity of A/antor’s boundary in¬ 
deed reflects on the behaviour of its Voronoi approximation and prevents 
the variance of the estimator from satisfying an asymptotic power law like 
in (10). This suggests that the boundary permeability condition is indeed 
significant in set estimation and not merely a contingent constraint due to 
the methods used to obtain 0. 


Plan 


The plan of the paper is as follows. In Section [lj we recall basic facts 
and definitions about self-similar sets, especially regarding upper and lower 
Minkowski contents. We then give conditions under which sets with self¬ 
similar boundaries are standard. Voronoi approximation is formally intro¬ 
duced in Section [2] We then derive the volume normal approximation for 
sets with well-behaved boundaries, as well as Hausdorff distance results. We 
also develop the counter example K c&ntor that satisfies neither the hypotheses 
of Theorem [I] nor the volume approximation variance asymptotics (10). 
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1 Self-similar sets 


1.1 Self-similar set theory 

This subsection contains a review of some classic results of self-similar set 
theory. A more precise treatment of the subject and most of the results 
stated here can be found in m Broadly speaking, a set is self-similar when 
arbitrarily small copies of the set can be found in the neighbourhood of any 
of its points. This suggests that a self-similar set should be associated with 
a family of similitudes. 

Let {(pi, i E 1} be a finite set of contracting similitudes. Such a set is called 
an iterated function system. Define the following set transformation 

ip : V(R d ) —> V(R d ) 

E —► U UE). 

i 

It is easily seen that ip is contracting for the Hausdorff metric, which happens 
to be complete on fC d , the class of non-empty compact sets of M. d . By a fixed 
point theorem, there is an unique set E E fC d satisfying ip(E) = E , which is 
by definition the self-similar set associated with the (pi. 

If there is a bounded open set U such as ip(U) = \J<pi(U) C U with the 
union disjoint, then necessarily E C cl(17) and the <pi are said to satisfy the 
open set condition. Schief proved in ra that we can pick U so that U fl E 
is not empty. This stronger assumption is referred to as the strong open set 
condition in the literature. 


The similarity dimension of E is the unique s satisfying 

A? = 1 


£ 


where A, is the stretching factor of (pi. When the open set condition holds, 
this similarity dimension is also the Hausdorff dimension and the Minkowski 
dimension of E. Furthermore, 77s upper and lower s-dimensional Minkowski 
contents (see Subsection 1.2) are finite and positive. This is an easy and 
probably known result, but since we have not found it explicitly stated and 
separately proven in the literature, we will do so here in Proposition [lj We 
will need the following classical lcnnnae, that we prove for completeness. 
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Lemma 1. Let (U,) be a collection of disjoint open sets in such that each 
Ui contains a ball of radius C\r and is contained in a ball of radius c^r. Then 
any ball of radius r intersects at most (1 + 2c2) d cf d of the sets cl (Ui). 

Proof. Let B be a ball of center x and radius r. If some cl (Up) intersects B 
then cl (Ui) is contained in the ball B' of center x and radius r(l + 2cf). If q 
different cl (Uf) intersect B then there are q disjoint balls of radius cir inside 
B' , and by comparing volumes q ^ (1 + 2c2) d cf d . □ 

Lemma 2. Suppose that E and the (pi satisfy the open set condition with 
U. Then for every r < 1 we can find a finite set A of similarities with 
ratios such that 

1. The are composites of the (pi. 

2. The &k(E) cover E. 

3. The $k{U) are disjoint. 

4. A s k = 1 where s is the similarity dimension of E. 

5. min,(Aj)r ^ A & < r for all k. 

Proof. We give an algorithmic proof. Initialise at step 0 with A = {Id}. 
At step n replace every <F e A with ratio greater than r by the similarities 
<F o ^A G I. Stop when the process becomes stationary, which will happen 
no later than step |"ln(r)/ln(max(Aj))~|. 

Obviously, point 1 is satisfied. We will prove the next three points by 
induction. At step 0, all of E is covered by the &k(E), the $*,(!/) are disjoint, 
and the A s k sum up to 1. The first property is preserved when $ is replaced 
by the o (p h since $(£’) = Q(ip(E)) = |J o (pi{E). Likewise, the o (pi(U) 
are disjoint one from each other because is one-to-one, and disjoint from 
the other $k(U) because (Jd>o (pfU) = <&(ip(U)) C which yields point 

3. For point 4 note that if $ has ratio A, then the o (pi have ratios AAj 
and A s = A s Af = ^(AAj) s so the sum of the Aj remains unchanged by 
the substitution. Finally, since r < 1, every final set of the process has an 
ancestor with ratio greater than r. This gives the lower bound for point 5; 
the upper bound comes from the fact that the process ends. □ 
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Remark 1 . The process in the proof of Lemma [2] is often resumed by the 
formula 

n n —1 

A = {<f) h O (j) i2 ... o fa J Y[ \ k < r ^ Yl A 4>- 

k= 1 k =1 


1.2 Minkowski contents of self-similar sets 


Recall that the s-dimensional lower Minkowski content of a non-empty bounded 
set E cR d can be defined as 


lim inf 


r>0 


Vol(£ + R(0,r)) 

fd—s 


Similarly, the s-dimensional upper Minkowski content of E is 


lim sup 

r>0 


Vol(£ + R(0,r)) 

y>d—S 


In this paper, when both contents are finite and positive, we will simply say 
that E has upper and lower Minkowski contents. This leaves no ambiguity 
on the choice of s, since in that case s is necessarily the Minkowski dimension 
of E, i.e 

J r ln(Vol(£ + R(0,r))) 

s = a — hm-—-• 

r-> o m(r) 

We show below that self-similar sets always have upper and lower Minkowski 
contents. One can find an alternative proof for the lower content in [ 12l 
Paragraph 2.4], it can also be considered a consequence of 1-L S (E) > 0, like 
suggested in |18] . 


Proposition 1 . Let E be a self-similar set satisfying the open set condition 
with similarity dimension s. Then E has finite positive s-dimensional upper 
and lower Minkowski contents. 

Proof. As before, let fi be the generating similarities of E, A* their ratios, 
-0 : A i— > [J <fi(A) the associated set transformation, and U the open set of 
the open set condition. Choose any 0 < r < 1 and define the A k as in 
Lemma|2} Finally, write E k = &k(E),Uk = &k(U). 





We approximate E + 5(0, r) by the sets E k + 5(0, r), who are similar to 
the E + <h^ 1 (5(0,r)). By construction $^ 1 (5(0,r)) is a ball with a radius 
belonging to [1, (min* A,)' 1 ], so that 

Vol(5(0,1)) + Vol(5 + <h“ 1 (5(0,r))) + Vol(5(0, diam(5) + (rniriA^ 1 )), 

i 

because E is not empty. Applying <1+ gives 

cA d k + Vol(5 fc + 5(0, r)) + CA d k . 

Since 5 + 5(0, r) C [J k E k + 5(0, r) and A| = 1 we immediately get 
the upper bound 

Vol(5 + 5(0, r)) + Vol ( Ek + r )) 

^ Y CA " 

+ C^A s k r d ~ s 
< Cr d ~ s . 


For the lower bound we apply Lemma [I] to the disjoint U k . Since U is 
open we can put some ball of radius C\ in 5, and conversely we can put U 
in some ball of radius C 2 , since U is bounded. This means that each of the 
Uk contains a ball of radius r miry(A ? ;)ci ^ A k c,\ and is contained in a ball of 
radius rc 2 + A fc c 2 . So for any x G E + 5(0, r), B(x,r) intersects at most 
q of the E k (since E k C cl(E4)) with q a positive integer independent of r 


and x. This can be rewritten l_B+s(o,r) 



y^lgfc+Bio.r)- Integrating we get 


Vol(5 + 5(0,r)) ^ 


1 

Q 


Vol(5fc + 5(0, r)) so that 


Vol(5 + 5(0, r))> Y Vol(5 fc + B{ 0, r)) 
q 


c 

> - 


^ -(nhn A i) d - s 
q i 

> cr d ~ s . 


5 >: 


S r^d—S 


□ 
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1.3 Boundary regularity 


In order to formulate our result, we introduce the notion of proper and im¬ 
proper points. A point x € M. d is proper to K if Vol(0 fl K) > 0 for every 
neighbourhood O of x, it is improper to K otherwise. In other words, the 
set K pTop of proper points of K is the support of the measure Vol(ih fl •). 
Further use of proper points will be made in Section 2.2 We can already 


note that K must have no improper points if we want a positive lower bound 
for the f r on K. 

Our result holds for self-similar subsets E of dK satisfying the following 
assumption: 


Assumption 1. A self-similar subset E of dK satisfies the strong open set 
condition with some set U such that U fl dK C E and U \ dK has finitely 
many connected components. 

This assumption can be justified heuristically: if E cuts its neighbourhood 
into infinitely many connected components, then because of self-similarity it 
also does so locally, and K and K c are too disconnected to contain the 
balls mentioned in Theorem [lj Example [2] will show that these concerns are 
legitimate. 


Theorem 1. Let K be a non-empty compact set with no improper points 
and Vol(dK) = 0. Let E be a self-similar subset of dK for which Assumption 
[I ^ holds. Then there are constants 5, £ > 0 such that, for all r < S,x G E, 
both B(x, r) fl K c and B(x, r) fl K contain a ball of radius er. 

Proof. Let 0* be the generating similarities of E, A* their ratios, 0 : A i-> 
(J <pi(A) the associated set transformation. Denote by Vj the connected com¬ 
ponents of U\E. Since there are finitely many of them, we can suppose they 
all contain a ball of radius r > 0. Fix any 0 < r < 1 and x € E. 


Lemma[2]shows that there is a similarity <F with ratio A such that min(Aj)r ^ 
A < r and x e $(E). It follows that $(£/”) C B(x,r). We also have 
<F(f/) fl dK = $(U) fl E = $([/ fl E). Indeed, for any point x' of E outside 
<&(E) there is another similarity <&' of Lemma [2] such that x' G &(E) and 
&(U) n $(U) = 0, which implies cl($'(t/)) D $(U) = $'(cl(C/)) n $(U) = 
&(E) n $(U) = 0 so that x' 0 $(U). 
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Consequently, for all j, <F(Vj) has no intersection with dK. So ^(Vj) D 
int(iC) and $(Vj) fl K c are two disjoint open set sets who cover ^(Vj), and 
we must have either C K or $(Vj) C K c . 

Since there is a point y in <&([/) HE and K has no improper points, we must 
have Vol(/i flC), Vol(/i c nf/) > 0. Because Vol (E) = 0, this can only happen 
if one of the <J>(Vj) is included in K c and another in K. Hence B(x,r ) fl K, 
B(x,r ) fl K c each contain a ball of radius At. Since A ^ min(A ? ;)r, the 
conclusion of the theorem holds with £ = min(Aj)r. □ 

Remark 2. The theorem implies that K. K c have lower density bounds on 
E. More precisely, for appropriate S, £ > 0 

VxeE,r<5, f r (x),g r (x) ^ e. (4) 

This weaker statement is enough for our purposes regarding Voronoi approx¬ 
imation. 


We show below that the Von Koch flake provides a concrete example of an 
irregular set satisfying the hypotheses of Theorem [lj 

Example 1. Let E be the self-similar set associated with the direct simi¬ 
larities (f>i : R 2 —» M 2 sending S = AqA^ to a; = for i = 1,2, 3,4, 

in the configuration of Figure p~Tj Such sets E are called Von Koch curves. 
Looking at the iterates ^ n \S) in Figure [L2] gives an idea of the general form 
of the Von Koch curve and of why it is said to be self-similar. 

Note that the ^ n \S ) are curves, i.e the images of continuous mappings : 
[0,1] —>• M 2 . The y n can be chosen to be a Cauchy sequence for the uniform 
distance between curves in M 2 . Hence their limit 7 is also a continuous 
mapping, 7 Q 0 ,1]) is compact and has distance 0 with E in the Hausdorff 
metric, so 7 QO, 1 ]) — E which proves that the Van Koch curve is, indeed, a 
curve. It can also be shown to be a non-intersecting curve (the image of an 
injective continuous mapping from [ 0 , 1 ] into R 2 ). 


With a similar reasoning, if we stick three Von Koch curves of same size 
as in Figure |1.3 we get a closed non-intersecting curve C. Jordan’s curve 
theorem says R 2 \ C has exactly two connected components who both have 
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Figure 1.1: The generating similitudes of the Von Koch curve. Z 2 is the 
center of the similarity </> 2 - 



Figure 1.2: The sets (S), (S). 

C as topological boundary. The closure K of the bounded component is a 
compact set with no improper points satisfying dK = C. K is called a Von 
Koch flake. 



Figure 1.3: The boundary of the Von Koch flake K. 
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Now, construct kites Ci, C 2 , C 3 on each of the Von Koch curves , E 2 , E 3 
making dK as in Figure |1.4| It is easy to see that as long as the two equal 
angles of the lower triangle are flat enough, Ci fl dK = C t fl E % . Further¬ 
more, applying Jordan’s curve theorem to the E t and the two upper (resp. 
lower) edges of the corresponding C* shows that the Q \ Ei have exactly two 
connected components. The strong open set condition is also satisfied, so 
Theorem [l] can be applied three times to obtain lower bounds for f r and g r 
on dK. 



Figure 1.4: 


Assumption 


a 


is satisfied with the kite C. 


Theorem [l] is only concerned with the behaviour of f r and g r on dK, 
whereas standardness assumption require lower bounds on all of K and K c 
respectively. The following lemma takes care of this issue. 

Lemma 3. If for all r < 6 we have f r ^£on dK, then for all r < 6 we have 
f r ^ 2~ d e on K. The same result holds if f r is replaced by g r and K by K c . 

Proof. If x is in dK r / 2 then B(x, r) contains a ball of radius r /2 centered on 
x' G dK, so f r (x) = Vol(A' fl B(x, r))K d 1 r~ d ^ Vol(A' fl B(x', r/2))n~ L 1 r~ d ^ 
£2~ d . If x is in K \ dK r / 2 then the ball B(x,r/2) is contained in K so that 
f r (x) ^ 2~ d ^ £2~ d . So in all cases, if x G K then f r (x ) ^ £2~ d . Replacing 
K by K c gives the result regarding g r . □ 

2 Voronoi approximation 

In this section, x is a locally finite point process , and n ^ 1. If X — Xn — 
{Xi, X 2 ,..., X n }, where the X, are iid random points uniformly distributed 
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over [0, l] d , we speak of binomial input; if x — x'\ is a homogenous Poisson 
point process of intensity A > 0 we speak of Poisson input. 

Define the Voronoi cell v x (x) of nucleus x with respect to x as the closed 
set of points closer to x than to x 

v x (x) = iv e Rd : Vx ' e X,d(x,y) ^ d{x',y)}. 

The Voronoi approximation K x of K is the closed set of all points which 
are closer to K fl x than to K c D X- Its name comes from the relation 

K x = U v x( x )- 
K 

The volume <p(x) — Vol(A x ) hrst arisecl in [16] as discriminating statistics 
in the two-sample problem. These authors proved a strong law of large 
numbers in dimension 1 for the volume approximation. Explicit rates of 
convergence in higher dimensions were obtained by Reitzner and Heveling 
[13], who proved that if K is convex and compact and x — X\ then 

E^( X ) = Vol(iP), 

Var(^( X )) ^ CX-^SiK), 


where S(K) is the surface area of K, all constants can be made explicit and 
depend only on d. They also studied the quantity <pp er (x) = Vol(A'A/i x ) to 
estimate the perimeter, after suitable renormalisation. Reitzner, Spodarev 
and Zaporozhets |2DJ extended these results to sets with finite variational 
perimeter, and also gave upper bounds for E| <p(x'\) q ~ Vol(A') 9 | for q > 1. 
Schulte [23j obtained a matching lower bound for the variance with convex 
K, i.e. cS(K)\~ 1 ~ 1 / d < Var(<p(x)), and derived the corresponding CLT 

¥>(x) -Eip(x) (d) N 

y/Vai((p( X )) 

Very recently, Yukich [26J gave quantitative Berry-Esseen bounds for this 
CLT similar to the ones that are stated here for binomial input. 

When dealing with binomial input, which has been less studied than 
Poisson input, it is necessary to assume that K c (0, l) d and redefine K x as 

K x = U v x (x) n [o, if. 

xGx^K 
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in order to avoid trivial complications dne to possibly infinite cells. Penrose 
[19] proved the remarkable fact that for x = Xn 

E^(x) ^ Vol(AT), 

E(<^Per(x)) —> 0 , 

almost surely, with no need for assumptions on K's shape. 

To further assess the quality of the approximation with binomial input, 
we must quantify the previous convergence. The unbiasedness of the Poisson 
case does not occur with binomial input, mainly because of edge effects. 
Nevertheless those effects seem to decrease exponentially with the distance, 
like is customary for Voronoi cells. The following result shows that the bias of 
the estimator <p(Xn) decreases geometrically with n, therefore it is negligible 
with respect to the standard deviation, as shown in the following sections. 
Also, it still holds when (0, l) d is replaced by an arbitrary set U containing 
K in its interior. 

Theorem 2. Assume that K is a compact set with positive volume and let 
U be an open set containing K. Let Xn — {Aj, 1 ^ i n} be iid uniform 
variables on U. Then there is a constant 0 < c < 1 depending only on K 
and d such that for n ^ 1, 

|EVol(A x J-Vol(A)Kc". 

Proof. Let x.k = {Xi, i ^ k}. By homogeneity of the problem we can suppose 
Vol(U) = 1. The Voronoi approximation K Xn of K satisfies 

n 

E(Vol(KJ) = £ E (l* 6 *Vol(t>*.pt ( ) n v)) 

i=l 

= wE(lx„ G A-Vol(u Xll _ 1 (A n )ni7)) 

— n EVol(i ; Xn _ 1 (x) fl U)dx. (5) 

J K 
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Take 0 < r < \d(K , U c ). We have for all x G K 

ElVol^.^x) HU)) = E( [ l yeVxn _ i(x) )dy 


= E( 


'u 


^-B(y,\\y-x\\)nxn-l=<ll)dy 


= P(B(y,\\y-x\\)nXn-i = ^)dy 


'U 


= (l-VoliBiy^ly-xlDnU^dy 


'u 


' B{x,r ) 


(1 - K d \\y - x\\ d ) n 1 dy + c, 


where 


c n = (1 - Vol (B(y, ||y - x||) n U^dy. 

J U\B(x,r) 

For y G U\B(x,r), let B y be the ball interiorly tangent to B(y, ||r/ — x||) with 
center on [x,y] and radius r. We have B y C B(y, \\y — x||) by construction 
and By C U because B y C B(x, 2r). It follows that 


c n ^ 


/ (1-Vol (B y )) n ~ 1 dy= / (1 - < c£ 

’ U\B(x,r) JU\B(x,r ) 


for some 0 < c 0 < 1, noticing that Kd‘r d < Vol(U) ^ 1 because B(x,r ) C U. 
From there, a polar change of coordinates yields 


E(Vol(n Xn _ 1 (x) D U)) = / dKdt d 1 (1 — Kdt d ) n 1 dt + c n (because a d-sphere has surface dud) 

Jo 

(1 -K d t d ) n ^ r 


n 


+ Cn 


= - + 0(c n 


n 


for some c G (0,1). Reporting in (|5]) yields the result. 


□' 


Recalling that the estimator is unbiased if the underlying sample is Pois¬ 
son in this pleads in favor of Voronoi approximation against other estima¬ 
tors mm where the bias is not known and does not seem to be negligeable. 
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2.1 Asymptotic normality 

This subsection is concerned with the results of Ed, where it is shown that 
with binomial input, the volume approximation K x is asymptotically normal 
when the number of points tends to oo. Variance asymptotics and upper 
bounds on the speed of convergence for the Kolmogorov distance are also 
given. 

We begin by stating the boundary regularity condition necessary for these 
results to hold, which is related to the boundary densities studied in the pre¬ 
vious section. As explained in the introduction, it can be seen as a weakened 
form of the standardness assumption. Define, for all r > 0, the boundary 
neighbourhoods 


dK r = dK + B(0, r), 
dK~ = ()K r n I<, 

8K+ = ()K r n K c . 

Assumption 2 (Boundary permeability condition). A set K with no im¬ 
proper points satisfies the boundary permeability condition whenever 

lu ?>s if wnk) {L fHx) dx + L £{x) dx ) > °- (6) 

The following proposition gives a more meaningful equivalent for Assump¬ 
tion [21 


Proposition 2. Assumption^ holds if and only if 

1 f 


lim inf 


r >o Vol {dK r )n d r d J KxK c 


\x-y Sr 


dxdy > 0. 


(7) 


Proof. Let us begin by establishing the relation between the expression of 
(JT]) and K' s boundary densities. By Fubini’s theorem 


11 Irr— 


Ikxk c K d,r 


dxdy= [ Vol ( g V-y) n P) 


k Vol(B(x,r)) 
r Vo\(B(x, r) n K) 
t K c Vol(B(avr)) 


dx 


dx , 
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which rewrites simply as 



1 

K d r d 



a;—y||^r dx dy 



by definition of boundary densities. 


( 8 ) 


Consider the function h r = Ik hr +1 K c fr- We have 0 ^ h r ^ 1 and h r = 0 
outside of dK r . Applying the Cauchy-Schwarz inequality gives 


h 2 r V li r ^ y/Vo\(dK r )J / h 2 r 


which rewrites as 

1 / 


Vol(dK r ) J dKt 


fr Z + 


hi A 


I dKr 


w nor-' \ 1 til l||x—i/||<r dx dy J 

Vol (dK r ) \Kdr d J KxK c J 


< 


Vol (dK r ) \J dK f f ' + L- 


hi ,, 


so that clearly © and ([T]) are equivalent. D 

Remark 1. If K is bi-standard with constant e then ([6]) holds as well with the 
left hand being greater than e 1 . Hence bi-standarness implies the boundary 
permeability condition. 


Remark 2. Note that 


Vo1(<9a; + ) ^ 


i 

K d r d 



l||x-j/|[<r dx dy , 


so that Vo\(dK +) <C Vol (dK r ) prevents ([7]) from being satisfied. Of course, 
the same reasoning holds with dK~ instead. In other words, it is necessary 
for the boundary permeability condition to be fulfilled that both sides of the 
boundary have comparable volumes. 


We reproduce below the result derived in [HI Th. 6.1] for Voronoi ap¬ 
proximation, modified to measure the distance to the normal of the variable 
Vol( K Xn ) — Vol(iV), instead of Vol(K Xn ) — EVol(iC Xn ) like in the original 
result. This subtlety is important for dealing with practical applications 
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and obtaining confidence intervals for Vol(Ji). We deal with Kolmogorov 
distance, also adapted to confidence intervals, and defined by 


d K (U,V) :=sup|P(K ^f)-P(K^f)|, 
te K 

for any random variables U, V. 

Theorem 3. Let K be a compact subset of (0, l) d . Assume that for some 
s < d 


0 < liminfr s “Vol (dK r ) ^ limsupr s “Vol (dK r ) < oo, (9) 

r >0 r>0 

and that K satisfies the boundary permeability condition (Assumption [2|). 
Then 

, Var(Vol(A\. )) Var(Vol(A" )) 

0 < liminf- , , Xn — < limsnp- , Xn — < oo, ( 10 ) 

r >0 n~ 2+s / d r>0 n~ 2+s / d 

and for all e > 0 there is C e > 0 such that for all n ^ 1 
( VoK/VJ - VoI(A') 


d, 


•K 


V yVar(Vol(A x J) 
where N is a standard Gaussian variable. 


,JV ^ C £ n~ s/2d log(n) 4 - s/d+£ , 


( 11 ) 


Proof. This result is almost exactly m Th. 6.1], except that there it is 
proved that 


VoH Ay) EVol(A x „) \ ^ ( ; n 

yVartVolf/VJ) ) 


( 12 ) 


To have a similar bound involving Vol (AT) instead of E<p(Vol(A” Xn )), let us 
first remark that for del, a random variable U , and V = U + 5, 

d K (V, N ) < d K {V, N + 5) + d K (N + 5,N)^ d K {U, N) + (2tt)- 1 / 2 |A|, 


19 








since djc(V, N + S) = djc(U, N ). It follows that 


die 


( Vol(A x g - Vol(A) \ 
( \/Var(Vol(A-,J) ’ ) 




( V°1(A\J - EVol(AJ \ 
f gVar (Vol(A' Xll )) ’ ) 


+ (2ir)- 1 '' 2 


EVol(A' Xu ) - Vol(A) 
VVar(Vol(A x J) 


( Vol(A- v J - EVol(A- t „) N \ ___ 0(g) 
f v/Var (Vol(A v J) ’ / 


for some c G (0,1) by Theorem [2} Reporting the bounds of (12) yields ©• 

□ 


Remark 3. The fact that [0, l] d is the support of the random sampling 
variables does not seem to have a great importance. Uniformity over [0, l] rf 
eases certain estimates in the proof of [13 Th. 6.1] related to stationarity, 
but is not essential. If the variables are only assumed to have a positive 
continuous density k(x) > 0 on an open neighborhood of <9A, it should be 
enough for similar results to hold. See Theorem [2j or [19], for rigourous 
results in this direction. 

Remark 4. If K satisfies all the hypotheses of Theorem [3] except the bound¬ 
ary permeability condition, then we have 


sup 

te r 


P 


( Vol(A x J - EVol(AJ 


j _ \ A. TL / _ \ / 

V \/Var(Vol(/0 


^ t - P(1V ^ t) 


<: C £ n £ {a- 2 n~ 2+s / 2d + a- 3 n~ 3+s/d + a~ 4 n~ 4+s/d ) 

(13) 


where cr 2 is the variance of Vol(A" Xn ). See [T7I Th. 6.2] for more details. 


Remark 5. Set-estimation literature is also concerned with perimeter ap¬ 
proximation m Sec. 11.2.1], In the context of Voronoi approximation, the 
study of the functional Vol(A Xn AA) has been initiated in [13:, 20]. Although 
the result is not formally stated, a bound of the form (13) for this functional 
is available using the exact same method. One has to work separately to 
obtain a variance lower bound. Such a result with Poisson input has been 
derived very recently in the paper of Yukich 


20 



















Results regarding the volume of the symmetric difference between the set 
and its approximation can be used to compare Voronoi approximation with 
another estimator. Indeed, the bound in nT 1//d given in [L3] for EVol(A' Xn AK ) 
is better than the bound in (nr^) -1//2 of [4], who use the Devroye-Wise esti¬ 
mator with a smoothing parameter r n <C n _1 ^ d+1 ). 

The consequences of Theorems [T| and [3] for sets K with self-similar bound¬ 
ary are immediate, condition (|9]) automatically holds by Proposition [I] 


Corollary 1. Let K be a compact set such that c)K is a finite union of 
self-similar sets satisfying Assumption^ Then (10) and (11) hold. 

This corollary applies to the Von Koch flake with s = ln(4)/ln(3) (Ex¬ 
ample [T|) . The conclusions of Theorem [3] also apply for instance to the Von 
Koch anti flake, where three Von Koch curves are sticked together like for 
building the flake, but here the curves are pointing inwards, and not out¬ 
wards (Figure |2T ). Assumption fi] is not satisfied on the whole boundary, 
but it is within an open ball of M'Mntersecting one and only one of the three 
curves, and having Q on a self-similar E with same Minkowski dimension 
as dK is actually enough for the boundary permeability condition to hold. 



Figure 2.1: The Von Koch antiflake 


In Sect ion [273] we exhibit an example of a set K such that c)K is self-similar 
and K does not satisfy Assumption [l] We run simulations suggesting that 
(10) is also false. Our theorem gives a set of sufficient conditions, but other 
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versions should be valid. For instance, the question of whether a compact 
set K C M 2 whose boundary is a locally self-similar Jordan curve satisfies 
the conclusions of the theorem above seems to be of interest. 


2.2 Convergence for the Hausdorff distance 

In this subsection we will make use of r-coverings and r-packings. Consider 
a collection B of balls having radius r and centers belonging to some set 
E C M d . B is said to be an r-packing of E if the balls are disjoint. It is an 
r-covering if the balls cover E. 

The size of minimal coverings and maximal packings is closely related to 
the Minkowski dimension of E. A necessary and sufficient condition for E 
to have upper and lower Minkowski contents is that, for all small enough r, 
we can find an r-covering of E with less than Cr~ s balls, and an r-packing 
of the same set with more than cr~ s balls. More related results can be found 
in 


To estimate with precision r the shape of a set A by a point process x if 
is often necessary to request that every point of E is at distance less than r 
of x- In the context of Voronoi approximation, this is made precise by the 
following lemma. Note that we only require x to be dense enough near dA. 
This is, as suggested in the introduction, because Voronoi approximation fills 
in the interior regions of K where points of x are scarce. 

Lemma 4. Let x C M. d be a locally finite non-empty set. 

1. If every point x of cl (dKf ~) satisfies d(x, x) < r then K x C K + B(Q,r). 

2. If every point x of cl (dKff ) satisfies d(x, x) < r and every point x of 
cl (dK~) satisfies d(x, x hi K) < r then d„(K,K x )^r. 

3. If every point x of cl (dKff ) satisfies d(x, x hi K c ) < r and every point 
x ofcl(dK-) satishes d(x, x hi K) < r then dn(dK , dK x ) ^ r. 

4. If some point x G dK satishes d(x, x hi K) ^ 3r and d(x, x hi K c ) ^ r 
then dn{K,Kx) ^ r anc ^ dn(dK,dK x ) ^ r . 

Proof. We begin with the first point. Suppose x G K x satishes d(x, K ) ^ r. 
Then there is a point c x G x hi K such that x G v x (c x ). The segment 
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joining c x and x contains points from dK so we can consider Xq the point of 
cl (dK+) closest to x on that segment. We have d(xo,dK ) = r and xq G K c 
since otherwise there would be another point of c\(dK+) closer to x. As a 
consequence d(x 0 , c x ) ^ r. But then by assumption there is a point y of x 
such that d(x 0 ,y ) < r and c x isn’t the point of x closest to x, which is a 
contradiction. Hence x G K x implies d(x,K) < r and K x C K + 5(0, r). 

Note that in the setting of points 2 and 3 we can apply the previous 
argument to K c instead of K, the compacity of K not playing any role in 
the proof. Along with (K x ) c = (. K c ) x this yields K x C K c + 5(0, r), which 
reformulates as K \ cl(dK~) C K x by taking complements. Hence in both 
cases we have the inclusions 

K x C K + 5(0, r), K c x C K c + 5(0, r), 
and their reformulations 

K \ cl (dK;) C K x , K c \ cl(dK+) C K{. 

To prove the second point it is enough to show that K C K x + 5(0, r). 
Let x be a point of K. If x G K \ cl (pK~) then x belongs to K x . And if x 
is in cl (pK~) then there is a point y of y fl K such that d(x,y) < r. In all 
cases x G K x + 5(0, r). 

We move on to point 3. The two inclusions K c \ cl (dK r ) C K x and 
K \ cl (dK r ) C K x also show that if x satisfies d(x, dK) > r, x is interior to 
either K x or K x . Hence dK x C dK + cl(5(0, r)). Conversely, for every point 
x of dK there are points of both x H K and x H K c inside B(x, r), so B(x, r) 
contains a point of dK x . Hence dK C dK x + 5(0, r) and dn(dK, dK x ) ^ r. 

Lastly, suppose the requirements of point 4 are met. Let y be a point of 
X ft B(x, r ) fl K c . Then all of the points in B(x , r) are closer to y than to the 
points outside of B(x, 3r). Consequently all points B(x, r ) must lie in Voronoi 
cells centered in K c , and x ^ K x + 5(0, r) so that dn(K, K x ) ^ r. The fact 
that B(x,r ) C K' x also implies d(x,dK x ) ^ r and dH(dK,dK x ) ^ r. □ 
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Now we apply this lemma to show almost sure convergence of K x in the 
sense of the Hausdorff distance. To formulate such a result, the concept 
of proper points (beginning of Section 1.3) proves to be useful. Improper 
points are invisible to the Voronoi approximation K x of K. Though this has 
no incidence when measuring volumes, it becomes a nuisance when measuring 
Hausdorff distances. 

The set prop of points proper to K can be thought of as the complement 
of the biggest open set O such that Vol(0 D AT) = 0, from which it follows 
that A' prop is compact and that K x = A' prop a.s. 


Proposition 3. AT — y A' prop and <9AT — y f)K pmp almost surely in 

n n—>-+oo n n—>•+oo 

the sense of the Hausdorff metric for both Poisson and binomial input. 

Proof. Since K x = (A" prop ) x almost surely and A' prop has no improper points, 
this is equivalent to the fact that K Xn —y K and dK Xn —y dK almost surely 
when K has no improper points. By the Borel-Cantelli lemma it is enough 
to show that both series 

V P (d„(K Xn , K) > r), V p(d H (dK Xn ,dK) > r) 

n> 1 n^l 

are convergent for any positive r. 

Consider r /2-coverings B + ,B~ of cl(chA+), c\(dK~) respectively. Since 
both sets are compact, these coverings can be made with finitely many balls. 
Set B = B + U B~ and 

V = min ( min Vol(A IT AT, min Vol(T> IT K c ) 

\B£B- B&B+ 

Because K and K c have no improper points, V > 0. If every ball of B + 
contains a point of x C K c and every ball of a point of x H K, then the 
requirements of points 2 and 3 in Lemma [4] are met. The probability of this 
not happening is bounded by \B\{1 — V) n for binomial input and \B\e~ nl for 
Poisson input. In all cases the series associated with P(d H (K Xn , K) > r) and 
P{(].h{ dK Xn ,dK) > r ) converge, as required. □ 

A refinement of the method above gives an order of magnitude for dn(K, K x ) 
with Poisson input, under assumptions on dK, f r and g r resembling those of 
Theorem [3] This requires better estimations of the probability of the points 
of Lemma [5] being met, which is the purpose of the following lemma. 
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Lemma 5. Let A, x be non-empty sets, and B a collection of balls centered 
on A with radii r. Write B r for the collection of balls having same centers 
as those of B but radius rr, and choose Ti, 72 > 0 such that T\ + T 2 = 1. If 
B ri is a Tir-covering of A and every ball of B T2 contains a point of then 
A C X + B(0,r). 

Proof. Let a; be a point of A. By hypothesis, there is a ball of B with center 
c such that d(x, c ) < rpr, and also a point y of x such that d(y, c ) < r 2 r. 
ffence d(x,y ) < r(r1 + T2) and d(x,x) < t. So indeed every point of A is at 
distance less than r of x- Cl 


This handy lemma is meant to give probability estimations of events of 
the type A C x + -8(0, r), which are useful outside the context of Voronoi 
approximation. Typically, x is chosen to be a random point process, and the 
covering B T] is chosen deterministically with as few balls as possible, often 
CTf d r~ d . Bounding the probability that a ball of B T2 does not intersect x 
then gives an upper bound of the form 


P (A ^ x + B(0, r)) < \B\ max P (B fl y = 0). 


The estimations obtained in such applications are less sensible to the number 
of balls in B than to their size, ffence optimal results are obtained when T\ 
is small. 

For example, the reader may use Lemma [5] to derive [9j Th. 1] and 
its counterpart for Poisson input, which are concerned with the order of 
magnitude of d H (K,K fly) with x an homogenous point process. Note 
that use of Minkowski contents and boundary densities give slighlty better 


bounds, which turn out to be optimal, see Remark 10 


Theorem 4. Suppose that dK has Minkowski dimension s > 0 with upper 
and lower contents, and that for all r small enough, 

f r ^£ on K, 
g r ^ e on K c . 


Then we have 


P 

P 



d H (K,K A ) 

(A - 1 ln(A)) 1 /‘ J ^ 1 


A —y 00 


f 



d H (dK,dK x , x ) ^ \ 
(A -1 ln(A)) 1 / d 


X —^00 


1 
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where x'\ is a Poisson point process of intensity A and a, (3 satisfy a < cxk, (3 > 
/3k with 

- 1 ( s v /d 

aK 3 \dK d (l - e)) 



Proof. The approach of the proof is to tune r in Lemma [4] in order to have 
the events of points 3 happen with high probability. We shall only show the 
assertions regarding dn(dK, dK x ), since the exact same arguments hold with 
dn(K,K x ) as well. 

We start with the upper bound. For all A let Da be the event where 
all the requirements from point 3 of Lemma [4] are met with X — x!\i r — 
r\ = /3(A —1 ln(A)) 1//rf . Hence {dn(dK, dK x ) > r} C Bl c x . We shall show that 

P(n*)->o. 

Choose Ti,t 2 < 1 so that Ti + t 2 = 1 and t 2 /3 > (3k■ Let B + be a collection 
of balls with radius r and centers on cl(<9/l r + ). As in Lemma |5j call Bf the 
collection of balls with same centers as those of B + , but radius rr. Define 
B~, Bf similarity and set B = B + U B~. Note that B depends on A, but Ti, t 2 
do not. 

We can and do choose B + ,B~ so that £>+,£>“ are coverings of cl (dKf) 
and cl {dK~) respectively, and \B\ has less than Crf d r~ s = Crf~ d (X/In \)~ s / d 
balls. Indeed, consider ryr/ 2-packings of dKf and dK~ , both optimal in the 
sense that no ball can be added without losing the packing property. Because 
of volume issues, the packings have less than Crf d r~ s balls, and because of 
the optimality assumption doubling the radii of the balls gives the desired 
Tir-coverings. 

The intersection of K with a ball B G Bf of center x has volume exactly 
Krf(r 2 r) d / T2r (a;). Because for large enough A and r 2( d > (3k it follows 

that 

P(B n x n K = 0) ^ exp (—A T$£K d r d ) = \~ s / d ~ s 

for some 5 > 0. The same bound is valid for P(B D x C K c = 0), B G Bf 2 . 

Applying Lemma [5] twice with A = c)Kf , c)Ky successively gives 

P(ft£) < \B\\~ s/d ~ s < Crf d ln(A) s/d A- 5 
so that, since T\ is fixed. P(D^) —> 0 as desired. 
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The proof for the lower bound is quite similar. Fix 5 > 0, and redefine fl;*, 
to be the event where the requirements described in point 4 of Lemma [4] are 
met for y = r — r\ — a(ln(A)A _1 ) 1//d with a < ax- Again, we shall show 
P(f^) —> 0. Let B = B\ be a 3r-packing of dK. We can assume \B\ ^ cr~ s . 

The probability of there being no points of K fl in a ball B(x, 3r) of 
B and at least one point of K c fl y in B(x,r) for a point x in the boundary 
is exactly 

exp(-A«; d (l - g 3r (x))3 d r d ) (l - exp(-\K d g r (x)r d )) 

because B(x, 3r\)nK c and B(x, r)DK are disjoint. So we have the following 
upper bound, for A big enough 

P(fi £ ) < (1 - e" AKd(1 ~ e)3<V (l - e - AKdrd£ )) |B| . 

We would like the right hand to go to 0 with A. Taking logarithms this is 
equivalent to 

\B\ exp(— \Kd(l — e)3 d r d )(l — exp(—A Kdr d e)) —> +cx). 

A—»+oo 

Because exp(—\K d (l — e)3 d r d ) = \ 5 ~ s / d with S > 0, exp(—A Kdr d e) —* 0 and 
\B\ ^ c(A/ln(A)) s ' /(i , it is indeed the case. 

□ 

The proof and the result call for some comments. Most of them are minor 
variants on the result which were not included in the proof for clarity’s sake. 

Remark 6. It is possible to dispose of the hypothesis that dK has Minkowski 
upper and lower contents, by using instead the so-called upper and lower 
Minkowski dimension, which always exist, see [IB]. In particular, we can 
always do the coverings in the proof with Cr~ d balls, so the upper bound 
still holds after replacing s by d in the expression of /3k- This compares 
with the result given by Calka and Chenavier in [5J, Corollary 2], One can 
also show, using the fact that K is bounded and has positive volume, that 
T-L d ~ l {dK ) > 0 so that s can be replaced by d — 1 in the expression of ax- 
Hence a lower bound also holds with no assumption on dK 's geometry when 
d^2. 

For the results concerned with dx/dK, dK x ), this is a remarkable feature 
that to our knowledge no other estimators possess. For instance, in [9j a 
so-called expandability condition is required to obtain similar rates with the 
Devroye-Wise estimator. 


27 


Remark 7. If s = 0 and dK has Minkowski contents then actually d — 1, 
dK has a finite number of points, and dn(K, K x ) has order A -1 in the sense 
that for A large enough 

P{d H (K,K x ' x )X >t)^ 2\dK |exp(—2et), 

which is enough to guarantee the existence of moments of all orders for 
d H (K,K x ) A. This is not true of other shape estimators, and is due to the 
fact that Voronoi approximation only requires x to be dense near dK and 
not on all of K. If we don’t have Minkowski contents the situation might be 
more delicate. 


Remark 8. Better estimations of the P(f^) in the proof along with an 
application of the Borel-Cantclli lemma yield the almost sure convergence 
rates advertised in the introduction. Explicitly 


olk ^ lim inf 

n—>-+oo 


(tt - 1 ln(n)) 1 / d 


^ lim sup 

n —>-+oo 


Mk,k x q 

(tt , -1 \n(n)y/ d 



and similarity for dn(dK, dK x ), with ax,/3 k as in Theorem [4] and d’ K = 

(Pi + (1 lK d e)y/ d . 


Remark 9. For binomial input, some minor changes in the proof give the 
same upper bound. It can’t be done for the lower bound since we use the 
fact that x n A, x fl B are independent when A and B are disjoint and x 1S 
a Poisson point process. 


Remark 10. Using similar techniques as in the proof above it is possible to 
show that 

d H (K,Knx' x ) p ( 2(d-l) \ 1/d 
(A -1 ln(A ))d d V dK d ) 

if K has no improper points and dK is a C 2 manifold. Theorem [I] shows 
that, under the same assumptions, the above limit can be used as an upper 
bound for dn(K, K Xx ){\/ ln(A)) -1//d . Hence, as a shape estimator, K x is not 
worse than x H K- It would be interesting to know if it is better in some 
sense, a question related to the optimality of the bounds in Theorem [4j 


Remark 11. Applying point 2 of Lemma [4] instead of point 3 in the proof 
of the theorem yields a better result for dn(K, K x ). Specifically if f r > £ f 
on K then 


P 


/ d„(K,K x . k ) 
V(A-i hi(A))!/ d 



A—^oo 


l 







whenever 




s 

dtxd^ f 


1/d 


Together with Remark [6] this shows that inner standardness is a sufficient 
assumption to have convergence rates for d H (K,K x ). 


2.3 A counter-example 

Here we construct a set K cantoi with self-similar boundary not satisfying 
the boundary permeability condition. This example shows that Theorem [l] 
cannot be generalised by dropping Assumption [lj even if the conclusion is 
weakened. 

The example K below is uni-dimensional, but a counter-example in di¬ 
mension 2 can be obtained by considering K x [0,1]. 

Example 2. Let E C M the self-similar set generated by the similarities 
0i : i 4 i/3, 02 : x i—>• (2 + x)/3 who satisfy the open set condition with 
U = (0,1). E is in fact the Cantor set, and can be characterized as the set 
of points having a ternary expansion with no ones. 

A'cantor will be defined as the closure of open intervals of [0,1] \ E. The 
trick is to choose few intervals with quickly decreasing length, so that f r 
is small on most of /Cantor’s boundary, but to distribute them well so that 

<9 A cantor A. 


To every positive integer n associate the sequence s' n of its digits in base 2 
in reverse order and double the terms to get s n . For example, since 6 is 110 
in base 2, s 6 = (0,2,2). This defines a bijection between N and the set of 
finite sequences of zeroes and twos ending in 2, with the additional property 
that s n always has length l n C n. Now for all n define 




b 

A 


n 


n 


1 

3 n+1 




2 

3 n+1 




(ci n , b n ) 


3 k 

Q n 

3 fc 
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We have the following ternary expansions 


0.s"4--<000...01 
O.s^.-.s^OOO...0022222... 
0.sX-s"„000...02 


Now, set K = cl(U^„). We claim that K has no improper points, dK = E 
and that K does not satisfy the regularity condition of Theorem [3j 

Proof. The hrst assertion is easy to prove. Being segments, the A n have no 
improper points to themselves, so IJ A n C K prop and K C K prop by taking 
closures. 

For the second assertion we need to show that dK = K \[J A n = cl((J{a n , b n }). 
We already have the obvious dK C K \ [J A n . Define 




Since for all n, sf n = 2, the corresponding ternary expansions are 



= O.sX-s^-iOOO... 0022222... 

b' n = 0. S ^... S r n _ 1 000...02 


If x G Ai fl A'j then every ternary expansion of x has the same digits as the fi¬ 
nite ternary expansions of Oj, a'- up to the hrst 1, which is impossible. So |J A' n 
is an open set disjoint from (J A n and hence from K. Furthermore, LX is 
dense near the a n , because for all k, N e N*, we can hnd an a' k , whose ternary 
expansion has the same N hrst digits as the non-terminating expansion of 
a*;, so that d(ak,a' k ,) ^ 1/3^. A similar argument works for the b n , so that 
the a n , b n belong to dK and, since the latter is closed, cl([J{a n , b n }) C dK. 


30 




Finally, consider a point x G K \ (J A n . For all r > 0, B(x,r ) contains a 
point from an Ak, and since x ^ Ak, one of the two points (ik, bk must also 
be in B(x,r). Consequently, x is also an accumulation point of 
We just proved that K \ [J A n C cl(|J{a n , b n }). Putting this together with 
the previous two inclusions we get the desired equality. 

Since for all x G E, N G FT* we can find an a*, with the same first N digits 
as x in base 3, the a n are dense in E and E C dK. Conversely, dK C E, 
since the a n , b n belong to E, who is closed. 

For the last assertion, pick any r > 0 and set N = 2|~— log 3 (r)]. Let X 
be the union of the balls of radius r centered on the endpoints of the N first 
A n . X has area at most —4rlog 3 (r) and for any x G dK r \ X, B{x,r ) does 
not intersect the Ak,k ^ N. Since Xol(dK) = 0 

Vol(A- \ (A, U A 2 .. . U A„)) = Vol( |J A n ) = < r 2 . 

n>N 

But Vol (dK r ) has order j- 1 - 111 ! 2 )/ 111 ^) and 

Xo\{dK;) < Vol(A) + Vol (A \ (A 1 U A 2 .. . U A N )) V -4r log 3 (r) + r 2 

so that Xol(dK^) -C Yo\(dK r ). According to Remark |2j this prevents (j7|) 
from holding. O 


Simulations were made for the quality of the Voronoi volume approxi¬ 
mation with this set K. The magnitude order of the empirical variance of 
Vol {K Xn ) seems to be n T with r ~ —1.8, as shown in Figure 2.2 Look¬ 


ing at Theorem [3j the approximation behaves as if the set had a "nice 
fractal boundary of dimension « 0.2, whereas its real fractal dimension is 
1 -ln(2)/ln(3) » 0.37. 

Simulations also suggest that a central limit theorem still holds. Such 
a fact indicates that though the results of Lachieze-Rey and Peccati na 
seem to be generalisable, the variance of Vol (K Xn ) is indeed related to the 
behaviour of f r and g r near dK. 


Example 3. It is possible to construct other sets not satisfying the regularity 
condition of Assumption [2j If we don’t require dK to be a self-similar set, a 
much simpler example is given by 


K = 



1 

3”’ 



)■ 
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Figure 2.2: In blue ln(Var(iF Xn )) as a function of ln(n), in red the asso¬ 
ciated linear regression. For each n, the variance was estimated with 1000 
realisations of Vo\(K Xn ). 


Intentionally, dK looks like the set {n~ 1 ,n € N*}, who is often given as an 
example of a countable set with positive Minkowski dimension. K has no 
improper points, its boundary has Minkowski dimension 1/2 with upper and 
lower contents, but K does not satisfy © or (J7|). This can be proved using 
the same methods as in Example [2] Again, simulations tend to show that 
the variance of Vol {K Xn ) is about n T with r ~ —1, 8 and that a central limit 
theorem still holds. 
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